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A NEW TEST OF THE STATISTICAL NATURE OF THE BRIGHTEST CLUSTER GALAXIES 

Yen-Ting Lin 1,2 ' 3 , Jeremiah P. Ostriker 1 , and Christopher J. Miller 4 

ApJ, 715, I486 

ABSTRACT 

A novel statistic is proposed to examine the hypothesis that all cluster galaxies are drawn from the same 
luminosity distribution (LD). In such a "statistical model" of galaxy LD, the brightest cluster galaxies (BCGs) 
are simply the statistical extreme of the galaxy population. Using a large sample of nearby clusters, we show 
that BCGs in high luminosity clusters (e.g., L tot >4x lO 11 /^^©) are unlikely (probability < 3 x 10~ 4 ) to 
be drawn from the LD defined by all red cluster galaxies more luminous than M r = -20. On the other hand, 
BCGs in less luminous clusters are consistent with being the statistical extreme. Applying our method to the 
second brightest galaxies, we show that they are consistent with being the statistical extreme, which implies 
that the BCGs are also distinct from non-BCG lumino us, red, cluster galaxies. We point out some issues with 
the interpretation of the classical tests proposed by Trem aine & Richstond (119771 TR) that are designed to 
examine the statistical nature of BCGs, investigate the robustness of both our statistical test and those of TR 
against difficulties in photometry of galaxies of large angular size, and discuss the implication of our findings 
on surveys that use the luminous red galaxies to measure the baryon acoustic oscillation features in the galaxy 
power spectrum. 

Subject headings: galaxies: clusters: general - galaxies: elliptical and lenticular, cD - galaxies: luminosity 
function, mass function 



1. INTRODUCTION 

Over the past decade, the features in the cosmic microwave 
background angular power spectrum and galaxy clustering 
power spectrum due to the baryon acoustic oscillations (BAO) 
have emerged to be a premier standard ruler, and strong cos- 
mological constraints have been derived us ing this technique 
(e.g. iSpergel et al.1 I2003t lEisenstein et al.1 120051: iCole et all 
l2005HPadm anabhan et al l 120071) . In the galaxy BAO mea- 
surements, the tracer population of the large scale matter dis- 
tribution often employed is the so-called luminous red galax- 
ies (LRGs). These are massive elliptical galaxies charac- 
terized by an old and passively evolving stellar population 
(lEisenstein et al.l 120011) . and tend to be the dominant, cen- 
tral galaxies in gr oup or cluster scale dark matter halos [e.g., 
Zheng et al.l2008t throughout this paper we do not refer to the 
most luminous galaxy in clusters and groups differently, but 
simply use the term brightest cluster galaxies (BCGs)]. 

Recognizing the constraining power of BAO measure- 
ments, an impressive array of on-going and planned cosmo- 
logical experiments have adopted this method as the main 
survey component (e.g., BOSS3, Wiggle20, HETDEXlT, AT- 
LAS0, ADEPT, WFMOS). An important issue faced by these 
surveys is the control of systematics, including e.g., the cor- 
rections for the galaxy bias and the redshift distortion. In 
practical terms, it is necessary to know the statistical proper- 
ties of the BAO tracer population to exquisite detail (such as 
the way they "populate" their host halos, and their luminosity 
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function). As the LRGs are used as the tracer in most of the 
aforementioned experiments, it is critical to test for the homo- 
geneity of the LRG population. In this paper we address two 
key questions: Are BCGs different from other cluster galax- 
ies? Are BCGs distinct from non-BCG, luminous, red galaxies 
in clusters and groups? 

Because of their brightness and uniformity in luminos- 
ity, the BCGs h ave long been regarded as an ideal stan- 
dard candle (e.g.. lHumason et al]ll956HScottlll957l) . Through 
the systemat ic investigation s separ a tely lead by Sandage and 
Gunn (e.g., ISandagel I1972L fl973l ISandaee & Hard^Tl973|: 
i Gunn & Okdl 19751: iKristian et al.lll978t iHoessel etal]|l980l: 
ISchneider et al] [19831) . however, it was realized that various 
corrections [such as dependences of the BCG luminosity on 
the cluster richness class (lAbelll 19581) and Bautz-Morgan type 
dBautz & Morganlll970l). and the effect of galactic cannibal- 
ism (lOstriker & Tremaindll 9751) 1 have to be applied before 
any cosmological implication from the BCG Hubble diagram 
can be extracted. 

The small dispersion in BCG l umin osity (e.g., ~ 0.3 
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whether they are simply "the brightest of the bright" or a 
different population from other cluster galaxies all together. 
If the former hypothesis were true, the BCG luminosity 
distribution (LD) simply results from sampling of an uni- 
versal LD of all cluster galaxies, and its small dispersion 
in magnitude reflects the steepness of the bright end of the 
univer sal LD. To test for such a " statistical nature" of the 
BCGs, Tremai ne & Richstond (I1977L hereafter TR) devised a 
couple of statistics based on the mean and dispersion of the 
magnitude difference between the first- and second-ranked 
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galaxies (A and cta, respectively), and the dispersion of the 
BCG magnitude (cti). The basic idea is that, if the BCGs are 
simply the statistical extreme of the cluster galaxy population, 
both (Ti and cta would be greater or comparable to A. More 
specifically, under the assumptions that (1) the numbers 
of galaxies in non-overlapping magnitude intervals are 
independent random variables, and (2) the LD drops to zero 
rapidly at the extreme bright encQ, the following conditions 
need to be satisfied: 7j = cti / A > 1 and Ti = cta /A > 0.82 
(see also Loh & Strauss 2006). 

Using LR Gs to identify dense en vironments such as groups 
and clusters, lLoh & Straussl (|2006) found a large magnitude 
gap (A ~ 0.8 mag) between the first- and second-ranked 
galaxies in ~ 12,000 LRG-selected groups and clusters (for 
which the majority of the LRGs are the BCGs). They con- 
cluded that the LRGs are inconsistent with being the statis- 
tical extreme of the universal galaxy LD, based on the tests 
proposed by TR. 

One fundamental issue faced by studies of very luminous 
galaxies, such BCGs, concerns with their luminosity. On the 
practical side, for nearby BCGs, measuring their total light 
is nontrivial. The majority of BCGs are giant elliptical or 
cD galaxies, whose surface brightness profile is typically flat- 
ter than that of normal elliptical gala xies, and can extend to 
tens or hundreds of kpc (Tonrv 1987). To measure the "total" 
magnitude requires careful subtraction of the sky background. 
On the physical side, the cD envelope may well extend into 
the intracluster space, and it is sometimes difficult to sepa- 
rate the lu minosity of the BCGs from that of the intracluster 
stars (e.g.. iGonzalez et ail 120051) . In this study we will use 
data from the Sloan Digital Sky Survey (SDSS, lYork et all 
2000), which is not deep enough to detect contributions from 
the intracluster stars. However, it is known that the current 
SDSS pipeline has difficulty handling photometry of galaxies 
with large angular extents, and therefore may seriously un- 
derestimate the luminosity of BCGs (e.g.. | Lauer et alJl2007t 
Ivon der Linden etaT] l2007t lAbazaiian et alj 120081) . In such 
cases, A will be biased low. Although corrections to BCG 
photometry (if at all possible) may decrease the value of 71, it 
may have the opposite effect on ma king the interpretation 
of the TR tests more difficult (see $4.31 1. 

Here we propose a new way to examine the statistical na- 
ture of the BCGs that is less dependent on a robust measure- 
ment of the BCG luminosity and A, and apply it to a large 
sample of nearby clusters. Our method is both conceptually 
and operationally very simple: with a sample of N clusters, 
whose total galaxy luminosities are L toU -, where i= l,2,...N, 
one can combine all member galaxies to form a composite 
cluster. By drawing galaxies in random^ from the compos- 
ite cluster to create N mock clusters whose total luminosities 
are matched to the observed ones (see Fig. [TJ, the most lumi- 
nous mock galaxies constitute the "statistical" BCG sample. 
One can then examine the distribution of the BCG luminosi- 
ties (Lbcg) for the observed and mock clusters. In particular, 
we look for deviations from the statistical expectation from 
the correlation between Lb cg and L tot . As cluster luminosity is 



This condition is solely to ensure the integrals behave well in the TR 
analysis. The "extreme bright end" can be at arbitrarily high luminosity 
(S. Tremaine 2009, private communication). Other than these two assump- 
tions, we note that the form of the LD can be very general for the TR tests to 
be applicable - the LD does not even have to be continuous. 

10 In the process of creating the N mock clusters, a galaxy can be picked 
more than once, or not selected at all. 
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FIG. 1. — An illustration of our method for obtaining the LD of all clus- 
ter galaxies, and for creating mock clusters. By combining member galaxies 
from all N clusters, we get a "galaxy pool" that by definition should be rep- 
resentative of the LD. By drawing galaxies in random from the galaxy pool 
we then create N clusters w hose total luminosities are matched as close as 
possible to the real ones (see 33 . 1 I for details). Our galaxy selection is limited 
to M r = -20. 



correlated with its mass (e.g.,|Linetal3[2004), a comparison 
of the Lbcg-^tot correlation between the observed and mock 
clusters may provide some insights into possible cluster mass 
dependence (if any) in the BCG formation mechanism(s). 

The great virtue of the TR tests is that they only rely on a 
minimal set of assumptions concerning the nature of the LD, 
and is immune to any cluster-to-cluster variation. Our pro- 
posed test, on the other hand, although relying on the univer- 
sality of the LD, is independent of the shape of the LD. In this 
sense the two approaches are highly complementary. 

In ^2] we describe our cluster sample and the galaxy data. 
We show the results of our t ests i n §|3] and present a com- 
parison with the TR tests in $3.31 We apply our t est to the 
second brightest cluster galaxies (hereafter G2s) in $3.41 Ex- 
tensive tests have been carried out in order to show the ro- 
bustness of our results (® . We conclude by discussing the 
implications of our finding on the BCG formation scenarios 
and LRG-based BAO surveys in §|5] 

Throughout this paper we adopt a flat ACDM cosmological 
model where Qm = 1 = 0.3 and Hq = 70/i7okms~ I Mpc~ 1 . 

2. CLUSTER SAMPLE AND GALAXY DATA 

Our clusters are drawn from an u pdated version of the 
C4 cluster catalog (Miller et al. 20 05|). which uses data from 
the fi fth data release of the SDSS dAdelman-McCarfhv et ail 
120071) . Of the 2037 clusters in the sample, we restrict our- 
selves to 494 that lie within the redshift range z = 0.030- 
0.077, with velocity dispersion ct > 200 km/s, and contain at 
least 2 luminous galaxies (see below). While the lower limit in 
redshift helps to alleviate problems in photometric measure- 
ments of bright galaxies with large angular extents in SDSS, 
the upper limit is chosen to ensure that nearly all galaxies with 
M r < -20 have redshifts measured by SDSS (i.e., extinction- 
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corrected pe trosian magnitude r„ < 17.77, among other selec- 
tion criteria: IStrauss et aT1l2002l) . Note that the characteristic 
magnitude for cluster red galaxies is M* >r = -21.70 (see Ta- 
ble[4]below), based on our cluster sample. In addition, as we 
will combine galaxies from all clusters to form a composite 
cluster, the small redshift range chosen prevents any redshift 
evolution within our cluster galaxy sample. 

Our first task is to assign cluster membership to galaxies in 
the SDSS main sample. For every cluster, we include only 
red galaxies with redshift |z— z c i\ < 3er/c, with r-band abso- 
lute petrosian magnitude M r < -20, and are projected within 
0.8/170 Mpc of the cluster ce nter (defined as the peak of bright 
galaxy density distribution; iMiller etldl 120051) . Here z c i and 
c are the cluster redshift and speed of light, respectively. A 
galaxy is considered red if the model color u-r > 2.2 (e.g., 
Strate vaet alj |2001). To calculate the absolute magnitudes, 
we use t he ^-correction base d on the NYU value-added galaxy 
catalog (Blanto n et al.120051) . In addition, a correction is made 
to convert the petrosian magnitudes into "total" magnitudes 
based on the surface brig htness profile of the galaxies, fol- 
lowing |&^ametin (120051) . 

On average, about 94% of galax ies in the SPSS ma in sam- 
ple have fibers assigned to them (Stra uss et al.l 120021) . This 
fraction becomes smaller in crowded fields, such as the clus- 
ters (see IMiller et"al]|2005l and below), due to the size of the 
fiber plugs. It is therefore critical to correct for such an in- 
completeness when trying to include all cluster galaxies using 
the SDSS main sample. For every cluster we first calculate 
the mean g — r and r—i colors c gr and c n of the spectroscop- 
ically confirmed member galaxies, then assume the galax- 
ies that (1) satisfy the above selection criteria (u-r > 2.2, 
r p < 17.7, projected distance < 0.8/iyQ Mpc), (2) were tar- 
geted for spectroscopy but were not assigned a fiber, and (3) 
have \g—r—c gr \ < 0.15 and \ r— i— c n -| < 0.15 as probable clus- 
ter members (hereafter referred to as the photometric mem- 
bers). We treat these galaxies exactly the same way as those 
with spectroscopic redshift measurements, and derive the ab- 
solute magnitudes by assuming they are at the cluster redshift. 

Combining the lists of the spectroscopic and photometric 
members, we record the absolute magnitudes of the galax- 
ies for each cluster. The 5980 galaxies with M r < -20 from 
all 494 clusters form our composite cluster, or a "galaxy 
pool" that by definition has the statistical galaxy LD (see ^TJ. 
Among these galaxies, 871 are photometric members (includ- 
ing 97 BCGs). 

In our analysis we will construct two cluster subsamples 
according to the total r-band luminosity L tot from the mem- 
ber galaxies: a high-luminosity sample, consisting of 124 sys- 
tems with L tot > Ldiv = 3.7 x l0 n k^LQ, and a low-luminosity 
sample with 370 systems whose L to t < L<fiv- The cluster lu- 
minosity L tot is the sum of luminosities from all red member 
galaxies (including the photometric members) more luminous 
than M r = -20, projected within a radius of 0.8/17,' Mpc. As 
the SDSS main sample has complete spectroscopic coverage 
(modulo fiber collision) for M r < -20 for all our clusters, no 
extrapolation in the luminosity function is needed to obtain 
Ltot- The division luminosity is chosen so that each sub- 
sample contains roughly equal number of galaxies. 

In Fig. |2] we show the distribution of number of galaxies 
for the two subsamples. The histograms are normalized to the 
total number of clusters in the respective samples (red/solid: 
high-luminosity; blue/dashed: low-luminosity). We see that 
> 92% of low-luminosity systems have at least 5 red galaxies 
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FIG. 2. — The red/solid and blue/dashed histograms show the normalized 
distribution of number of galaxies for the high- and low-luminosity subsam- 
ples, respectively. More than 92% of systems in the low-luminosity sample 
contain at least 5 red galaxies (more luminous than M r = —20, located within 



Mpc). 
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FIG. 3. — The red/dotted and blue/dashed histograms show the normal- 
ized redshift distribution for the high- and low-luminosity subsamples, re- 
spectively. The two samples have similar distributions. For comparison, the 
green/solid curve shows the (arbitrarily) scaled differential comoving volume 
element dV /dz. As the redshift distributions roughly follow dV /dz, our sam- 
ples are close to volume-limited. 



above our chosen magnitude limit. As we will show below, 
our results are not dependent on the faintest or most luminous 
systems. 

Fig. [3] presents the normalized redshift distribution 
dN /dz of the two subsamples (red/dotted: high-luminosity; 
blue/dashed: low-luminosity). We find that the two samples 
have similar dN /dz: the median, mean, and lcr scatter of the 
high-luminosity sample are 0.0662, 0.0624, and 0.0126, re- 
spectively. The same quantities for the low-luminosity sample 
are 0.0616, 0.0593, and 0.0126. The green/solid curve in the 
Figure is the differential comoving volume element dV /dz, 
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scaled to the height of the distributions. The similarity be- 
tween the redshift distributions and dV / dz suggests that our 
samples are roughly volume-limited, and should be represen- 
tative of the nearby groups and clusters. 

3. THE Lbcg-Ltot CORRELATION AND THE 
STATISTICAL NATURE OF BCGS 

Correlations between the luminosity of the BCGs and the 
properties of host clusters such as mass or to tal luminos- 
ity have been noted by previous studies (e .g., iLin & Mohri 
2004 lYang etail l200l Hansen et all l2007h . Here we uti- 
lize the BCG luminosity-cluster luminosity correlation (Lb cg - 
L tot ) to show that BCGs are indeed special - at least in lumi- 
nous/massive clusters. 

In Fig. @] (top panel) we show the observed Lbcg-L to t cor- 
relation for the C4 clusters. The magenta squares in the top 
panel are the mean of BCG luminosity Lb cg ,obs, as a function 
of L tot . The meaning of the cyan crosses will be described 
below. 

3.1. The Mock Clusters and BCGs 

To check for the statistical nature of BCGs, we proceed as 
follows. Based on the observed LD, we generate many real- 
izations of the Lbcg-^tot correlation, from which we derive the 
mean Lbcg-L to t relation that is expected if the BCGs are drawn 
from the same LD as other cluster galaxies. We can then com- 
pare the observed mean L\, cg -L tot relation and determine if it 
is consistent with the mean from the mock data. 

We show in the middle panel of Fig. |4]one such realiza- 
tion. Basically, for each cluster in our sample, we create a 
corresponding mock cluster by randomly drawing galaxies 
from the galaxy pool, until the total luminosity of the mock 
cluster matches that of the observed one (see also Fig. [T). 
Specifically, suppose the first N galaxies give a total lumi- 
nosity of < L t ot, and the next mock galaxy brings the to- 
tal luminosity to L^+i > L tot . We keep the 1-th galaxy 
if Ljv+i - Ltot < |Lv-L t ot|, otherwise we only use the first N 
galaxies. Because of this procedure, although the total lumi- 
nosity for massive (luminous) clusters can be matched fairly 
well, for low luminosity clusters the difference in L tot between 
the mock and real clusters can be large. 

The black squares in the middle panels of Fig. |4] show the 
mean in BCG luminosity, Lbcg.sim in this particular ensem- 
ble of simulated clusters. The cyan crosses in both the top 
and middle panels are the mean of Lbcg,sim from 200 Monte 
Carlo realizations of the mock cluster ensemble, which we 
denote as (Lbcg.sim)- It is found that for luminous clusters 
(e.g., L tot > 3.7 x lO n /!7oL ), the observed Lb C g,obs is higher 
than (Lbcg.sim) from mock clusters (e.g., compare the magenta 
squares with the cyan crosses in the top panel). For low lumi- 
nosity clusters, the observed and mock values are comparable. 

We list in Table[T]the definition of various terms employed 
in this and the following subsections. 

3.2. Statistical Significance 

By comparing the observed and mock Lbcg-L to t correlations 
we infer that BCGs in real clusters have systematically higher 
luminosities than the mock BCGs. One might easily imagine 
a systematic measurement error that systematically gives too 
large a luminosity for bright galaxies. But that would not pro- 
duce the effect we find. One would need a systematic error 
that increased the brightness of BCGs but had a less dramatic 
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FIG. 4. — The Lbcg-Loi correlation for the real clusters (top panel) and one 
of the Monte Carlo realizations of mock cluster ensemble (middle panel). 
The squares in the top and middle panel are the mean of the BCG lumi- 
nosity for the real and the mock clusters (Lbcs.obs and Lbcg.sim)- respectively. 
Also shown in both panels as crosses are the mean BCG luminosity averaged 
over 200 Monte Carlo realizations, (Lbcg.sim)- In the bottom panel we show 
the distribution of d ^ = logLb cg ,obs _ log (lbcg.sim) (blue solid points) and 

d s im = logLb C g lS im — log (lbcg.sim) (red open points). The real BCGs are sys- 
tematically more luminous than the statistical ones in luminous clusters. The 
errorbars of are derived from 200 bootstrap resampling of the observed 
Lbc g -Lot correlation. 



effect on second brightest galaxies in other (more luminous) 
clusters of the same intrinsic luminosity. Here we quantify the 
significance of the difference between the two populations. 

Using 200 realizations of the mock cluster ensemble (each 
containing 494 clusters), we calculate (Lb cg ,sim), as well as 
the difference between logarithms of the mean BCG luminos- 
ity Lbccsim from individual realizations and (Lb cg .sim), <4im = 
log Lbcg.sim -log (Lbcg.sim)- By dividing the cluster sample into 
15 bins in L tot , each containing roughly equal number of clus- 
ters, for each mock cluster ensemble we have 15 evaluations 
of the statistic c/ s ; m . One such example is shown in the lower 
panel of Fig. [4] (open red symbols). We see that a! s j m roughly 
scatters about zero. We expect that for mock clusters, the dis- 
tribution of d s i m should follow a Gaussian, which is shown as 
the blue histogram in the top panel of Fig. [5] The green curve 
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TABLE 1 
Definition of Terms 



meaning 



^-bcg.obs mean BCG luminosity of the real clusters (in 15 Lt 0l bins) 

A>cs sim mean BCG luminosity of one Monte Carlo ensemble (in 15 L tot bins) 

(^bcg.sim) average of Lb cg . s j m over 200 Monte Carlo ensembles; this is the statistical expectation 

dobs = log Lbcg.obs - log ( ^bcg.sim ) 

dmm = logibcg.sim "log (£*bcg,sim) 
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TABLE 2 

Probability of BCGs being the Statistical Extreme 
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FIG. 5. — Distribution of the statistic d, def ined as the differ ence bet ween 
logarithms of the mean BCG luminosity Lb ce and the mean of Lb cg ,sim from 
200 Monte Carlo realizations of the mock cluster ensemble. The distribution 
of the observed d„bs is shown as the blue histograms, while that for the mock 
clusters is shown as the red ones. The top, middle, and bottom panels show 
the results for the whole cluster sample (494 systems), for high luminosity 
clusters (Ltot >Ldiv = 3.7x I0 n h^Lir,; 124 systems), and for low luminosity 
clusters (L to t < L$„\ 370 systems), respectively. In each panel the green 
dotted curve is a Gaussian fit to the red histogram. For the high luminosity 
clusters the difference between the real and mock data is significant at the 
0.028% level (see Tablef2). 



is a Gaussian fit to the histogram. 

We can similarly calculate d 0DS = logLt,cg,obs — log(Lbcg,sim) 
for the real clusters. We show d \j S as a function of in 
the lower panel of Fig. |4] as the solid blue points; it is ap- 
parent that ii bs correlates positively with cluster luminosity. 
The errorbars of fi? b s are derived from 200 bootstrap resam- 
pling of the observed Lbcg-Ltot correlation. The distribution 
of Jobs is further shown as the red dashed histogram in the 
top panel of Fig. [5] and is clearly different from a Gaussian. 
A Kolmogorov-Smirnov (KS) test shows that the probabil- 
ity of d b s to be drawn from the same distribution of d s i m is 
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method for creating mock clusters 


galaxy pool Gaussian+Schechter" 


single Schechter" 


all 


494 


0.8% 0.5% 


0.6% 


high 


124 


0.03% 0.01% 


0.02% 


low 


370 


54.5% 39.0% 


44.5% 



number of clusters; the division of high and low luminosity subsamples is 



3.7 X W n h^L Q (see g). 
b see jj4.2|for more details. 



V = 0.79%. 

We examine in more detail the distribution of d in high and 
low luminosity clusters in the other two panels of Fig. [5] The 
distribution of d for the high (low) luminosity subsample is 
shown in the middle (bottom) panel. As we note above, in 
high luminosity clusters the real BCGs are on average more 
luminous than the mock ones, resulting positive d 0DS , and 
there is only a 0.028% probability that they are drawn from 
the same LD, based on a KS test. On the other hand, Lb cg ,obs 
in low luminosity clusters scatter about (Lbcg.sim}, and d b S 
and d S i m have a much higher probability (V = 54.5%) to be 
drawn from the same distribution. These results are recorded 
in Table [2] (the column und er "galaxy pool"; for meaning of 
the other columns, see j 



3.3. Tremaine-Richstone Tests 

Next we compare our results with those from the TR tests. 
Based on the whole cluster sample, we find that a\ = 0.58, 
A = 0.62, and er A = 0.49, resulting in T\ = ai/A = 0.93 and 
T 2 = cta/A = 0.78. Recall that if the BCGs are "statistical", 
we would have obtained T\ > Ti,y lm = 1 and T2 > Tijim = 0.82 
(§G~1 but see below). Note that these lower limits are de- 
rived under rather general assumptions about the form of the 
LP (TR). For examp le, assuming the underlying LD follows 
the lSchechterl (TT976) form, the limits are 7i ii m sc h « 1.16 and 

72,lim,sch ~ 0.88. 

Dividing the clusters into high and low luminosity sub- 
samples (using the same division luminosity Ldiv as in 33.21 ). 
we find (cri , A, a a , T\ , T 2 ) = (0.39, 0.55, 0.43 , 0.70, 0.79) and 
(0.54,0.65,0.50,0.84,0.77), respectively. That T x for the 
whole sample is larger than Tj for both high and low lumi- 
nosity subsamples is due to the weak, positive correlation be- 
tween L^g and L tot . 

Both our test and the TR tests indicate that, using the whole 
cluster sample, the BCGs are not drawn from the same LD 
as other cluster galaxies. Separating the low luminosity clus- 
ters from the luminous ones, our test suggests that this con- 
clusion is mainly driven by the BCGs in the luminous clus- 
ters. The 7] statistic implies a larger deviation from the 
statistical expectation for BCGs in high luminosity clusters 
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(T^iim- 7] = 0.30), compared to their counterparts in less lu- 
minous clusters (71 ii m — 71 =0.16). 

It is important to quantify how significant are the deviations 
of the Tj (i= 1 , 2) values we observe from the statistical limits. 
Without assuming any particular form of the LD, we evaluate 
the significance empirically, using the "galaxy pool" from all 
the member galaxies. More specifically, we construct ensem- 
bles of mock clusters following the method described in jj3.ll 
we can then derive the mean and dispersion of the two statis- 
tics from the distributions of 71 and T 2 based on the mock 
clusters (see Fig. [5). For all clusters in our sample, when 
the BCGs are generated statistically, the mean and standard 
deviation of the two statistics are 7j = 1.01, a Tl = 0.05, and 
%_ = 0.90, a Tl = 0.03. We find that about 3.5% of the Monte 
Carlo realizations have 71 as low as the observed value (0.93), 
but only 0.007% have T 2 as low as 0.78, the observed value. 
We denote these fractions as p\ = p\(< 71. bs) and p 2 = p 2 {< 
Tiobs)- For high luminosity clusters, (Ti,aT l ,p\,T 2 ,aT 2 ,p 2 ) = 
(0.96, 0.08, 0.014%, 0.84,0.06, 14%). For low luminosity 
ones, the values are (0.90, 0.05 , 8.8% , 0.88, 0.04, 0.07%). 

Additionally, we can estimate the distribution of the ob- 
served 7} via bootstrap resampling. The overlapping area un- 
der the "observed" and the "statistical" distributions (the lat- 
ter from the mock cluster ensembles, as mentioned above) 
then gives an estimate of the likelihood that the observed 7} 
is consistent with the statistical expectation. We denote the 
likelihood as qi = <7,(7/. b s <~) 7}, sta t), i = 1,2. For the whole 
cluster sample, we find (q\,q 2 ) = (8.8%, 1.4%), For high and 
low luminosity clusters, the results are (0.35%, 23.4%) and 
(14.9%, 0.64%), respectively. 

For clarity, these results are summarize in Table [3] We 
present the results for the whole sample, as well as the high 
and low luminosity subsamples. For each (sub)sample, the 
numbers under "obs." are the observed values, and the triplet 
of numbers under "stat." denotes the mean and standard de- 
viation of the TR statistics based on 10 5 ensembles of mock 
datasets, and pi (i = 1,2). Looking at p\ for the high and 
low luminosity clusters, one would conclude that BCGs in 
the faint clusters are much more likely to be statistical. One 
would draw the opposite conclusion if considering the T% 
statistic, however. It is therefore not clear if the TR tests give 
a consistent picture (e.g., dependence on L tot ) for the degree 
of deviation of BCGs from the galaxy popul ation in clusters. 

Using a large group sample from SDSS, Yang et al. (2008) 
examined the gap statistics. Although they did not explicitly 
calculate the values of 7}, using the results presented in their 
Fig. 7, we can roughly estimate that for clusters with virial 
mass logM « 14.8, (71, Ty ~ (1.4,0.7), and for clusters with 
logM w 14.0, (T U T 2 ) ~ (0.6,0.8). We see that the 7] values 
have opposite trends with cluster mass: based on 71 , the BCGs 
in low mass systems are special; using T 2 one would be lead 
to the opposite conclusion. 

In their study of LRG-selected groups and clus- 
ters, lLoh & Straussl (120061) found that (a u A,o- A ,71,r 2 ) = 
(0.30,0.87,0.52,0.35,0.59) at z ~ 0.12. They also examined 
the richness dependence of the TR statistics. For the rich- 
est systems (75%- 100% quartile in the richness distribution) 
they found (71 , T£) = (0.75,0.71); for those with richness in the 
25% -50% quartile (with about 2-3 galaxies on the red se- 
quenc e), (71,72) = (0.27,0.34). The results of lLoh & Straussl 
(2006) suggest a dependence of the TR statistics on richness 
that is opposite to our findings (if their richness estimator cor- 
relates well with the total luminosity). 



The main cause for the differences seems to be in the 
magnitude gap, A. Although for the richest systems, 
lLoh & Stra uss (2006) obtained A ~ 0.55, a value compara- 
ble to ours, their finding of A ~ 1 for the much poorer sys- 
tems (25% - 50% q uartile in richness) is much higher than 
our value. Because lLoh & Straussl (120061) looked for galaxy 
concentrations around LRGs using (primarily) photometric 
data, we suspect many of their low-richness systems may 
not be real, but simply chance projections of galaxies with 
sim ilarly red colors (c. f. purity of the maxBCG algorithm; 
see iKoester et al.l 120071) . On the other hand, we note that 
lYang et aTl (120081) also found that A > 1 for their low mass 
systems (logM ~ 13). 

These results seem to suggest that, when considered to- 
gether, the two TR statistics may not unambiguously deter- 
mine the statistical nature of BCGs with respect to cluster 
mass (or its proxy, such as total galaxy number or luminos- 
ity). 

The disagreements among the different studies may be at- 
tributed to several factors (such as the way BCGs are selected, 
the way background contamination is treated, or even the 
group/cluster selection). A possible resolution would be to 
compare the selection of BCG and G2 on a cluster-by-cluster 
basis, for the clusters that are present in multiple catalogs. 

3.4. Are G2s Special? 

We have applied a new method to confirm that the BCGs are 
special. This approach can be used to check if G2s are statis- 
tical, that is, are their LD consistent with that of the overall 
cluster galaxy population. The results can then be used to an- 
swer the second question we set out to address: Are BCGs 
different from other luminous, red, cluster galaxies'? 

We proceed in a similar fashion as in the previous sections. 
The main difference is that we are now comparing the ob- 
served L g 2-7tot correlation with the corresponding mean rela- 
tion from mock clusters. Requiring that mock clusters must 
have at least two members causes a small modification to the 
way mock clusters are constructed, which mainly affects the 
low luminosity clusters. Therefore we will set a lower lu- 
minosity limit at L\ lm = 1.6 x \Q n H^Lq, and define the low 
luminosity subsample (234 systems) with L\[ m < L tot < L^, 
where L& Y = 3.7 x 10 HjqLq. The 124-cluster high luminos- 
ity subsample remains the same as before. 

In Fig.|7]we show the distribution of the quantity d for G2s, 
which is analogous to Fig.|5]for the BCGs. The bottom, mid- 
dle, and top panels are the results for the low luminosity sub- 
sample, high luminosity subsample, and the combined sam- 
ple (358 systems), respectively. In all samples considered, we 
find that G2s are consistent with the statistical expectation. 
KS tests suggest that V = 26%, 55%, and 24% for all, high, 
and low luminosity clusters, respectively. Our results are not 
sensitive to the exact value of L\i m . 

This result immediately suggests that BCGs are distinct 
from G2s, and very likely, other luminous member galaxies. 

This simple exercise also demonstrates one advantage of 
our method over the TR tests: while the latter by definition 
could not determine the statistical nature of G2s, our method 
can in principle be applied to even the third- or lower-ranked 
member galaxies. 

4. SYSTEMATICS 

In this section we examine the robustness of our findings 
on various aspects in the analysis, including the selection of 
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FIG. 6. — Distribution of the T[ and T2 values for the real clusters (based on 5000 bootstrap resampling) and mock clusters (derived from 10 5 ensembles). The 
left panels show the distribution for 7\ , while the right panels are for To . From top to bottom, we show the results for all the clusters, high and low luminosity 
subsamples, respectively. In each panel, the solid histogram is the distribution of the mock clusters (with the short solid arrow indicating the mean value), and 
the dotted histogram is that for the real clusters (with the mean indicated by the short dotted arrow). The overlappingarea under the solid and dotted histograms 
represents an estimate of the likelihood the observed value is consistent with the statistical expectations (see TableQ}. Looking at the middle left panel (for T\ 
values of high luminosity clusters), one sees that the dotted histogram is at the extreme of the solid red histogram, and would conclude that BCGs are unlikely to 
be statistical (based on the T\ statistic). However, in the middle right panel (for 7?), the dotted histogram is well within the dotted blue histogram, suggesting the 
BCGs are statistical. The opposite situation happens when one looks at the bottom panels (for low luminosity clusters). 



TABLE 3 
Tremaine-Richstone Statistics 



all clusters luminous clusters faint clusters 

obsT stat. b (p.qf obs? stat." {pTqf obs. a stat." (p.qf 

7] (0.93,0.01) (1.01,0.05) (3.5,8.8) (0.70,0.01) (0.96,0.08) (0.01,0.35) (0.84,0.01) (0.90,0.05) (8.8,14.9) 

T 2 (0.78,0.01) (0.90,0.03) (0.007,0.145) (0.79,0.01) (0.85,0.06) (14,23) (0.77,0.01) (0.88,0.04) (0.07,0.64) 

a observed value and the standard deviation based on 5000 bootstrap resampling. 

k statistical expectation and the standard deviation based on 10 5 Monte Carlo realizations. 

c p is the proportion of the Monte Carlo realizations that give TR statistics as low as the observed value, q is the area of the overlapping region under the observed 
and statistical distribution of the TR values (see Fig. [6}. Both p and q are in %. 



cluster red galaxies in SDSS, the way mock cluster and galaxy 
samples are constructed, and the issue of photometry of galax- 
ies with large angular extent. 

4. 1 . Sensitivity on Galaxy and Cluster Sample Selection 

We first check the sensitivity of our results on the galaxy 
selection criteria, by repeating our test using a much more 
stringent set of conditions to assign cluster memberships for 
red galaxies. In addition to the basic requirements {u—r > 2.2, 
r p < 17.7, projected within ORh^ Mpc), for the spectroscopic 
members, we only include galaxies whose (1) redshifts satisfy 
c\z-Zd \ < v, where v = min(2er, 1500 km/s), (2) g—r and r—i 



colors fall in the range c— 1.5cr c to c + 2.5a c , where c and a c 
are the mean and dispersion of the g — r and r—i colors from 
the galaxies at redshifts broadly consistent with the cluster, 
(3) "concentration parameter" c,„ = r^/rso > 2.6, where rao 
and r$o are radii that enclose 90% and 50% of the Petrosian 
flux, respectively; this last c ondition aims to sel ect galaxies 
with early type morphology (IStrateva et al.l l2001). As for the 
photometric members, along with the filters in the r-band flux, 
u — r color, spatial distribution, and morphology, we require 
that their g—r and r—i colors to lie between c — a c and c + 
1.5cr c , where c and a c are now derived from the spectroscopic 
members. 
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FIG. 7. — Distribution of the statistic d for G2. The distribution of the ob- 
served rf (, s is shown as the blue/dashed histograms, while that for the mock 
clusters is shown as the red/solid ones. The bottom, middle, and top pan- 
els show the results for a low luminosity subsample (1.6 X l0 ll h^LQ < 
Ltot < Aiiv; 234 systems), the high luminosity subsample (L t ot > £*div = 
3.7 X 10"/i^Lq; 124 systems), and the combined sample (358 systems). 
In all samples considered, we find that G2s are consistent with the statistical 
expectation. 



These criteria reduce our cluster and galaxy sample; only 
344 clusters containing 2819 galaxies are included. We find 
that the probability V that c/ b s and d s i m are drawn from the 
same distribution is 1.5%, 0.15%, and 67% for the whole, 
high luminosity, and low luminos ity ( sub)samples. These val- 
ues confirm the results found in 33.21 

We note in that 20% (« 97/494) of the BCGs are 
photometrically identified. Some of these may be fore- 
ground/background galaxies. To evaluate the effect of pos- 
sible contamination due to these photometric BCGs, we re- 
peat our analysis using the 397 clusters whose BCGs are 
spectroscopically confirmed members (with the membership 
assignment criteria of §0. Based on the galaxy pool con- 
structed from the 3661 galaxies in these clusters, we find that 
V = 1.7%, 0.28%, and 27%, for the whole, high luminosity, 
and low luminosity (sub)samples. 

Although the absolute values of V changes somewhat with 
respect to our nominal values recorded in Table [2] the trend 
remains clear that the BCGs in high luminosity clusters are 
much less likely to be drawn from the same LD as other clus- 
ter members when compared to their counterparts in lower 
luminosity clusters. 

Let us comment next on one effect our cluster selection 
may have on the results. The requirement that the clusters 



need to host at least two galaxies with M r < -20 potentially 
excludes systems dominated by a single ~ M* galaxy, such 
as the "fossil groups" (which are defined to have A > 2 and 
extended X-ray emissions). These systems are believed to 
evolve in isolation, with last major merger with other galactic 
systems being long enough in the past that a dominant cen- 
tral ga laxy can result from the dynamic friction ( Ponm anet al.l 
1994). The BCGs in these groups would then deviate signif- 
icantly from the statistical extreme, which makes them dis- 
tinct from the other BCGs in low luminosity groups included 
in our sample. One might be concerned that the exclusion of 
fossil groups from our cluster sample may have contributed to 
our conclusion that, for the lower luminosity systems, there is 
no statistically significant deviation of BCGs from the statis- 
tical distribution of all cluster galaxies. However, given the 
rarit y of the fossils (n umbe r density ~2x 10~ 6 /iL Mpc" 3 ; 
e.g., lLa Barbera etakl 120091: IVoevodkin efafl 120091) . we ex- 
pect there to be at most ~ 20 such systems in the volume we 
sample (z = 0.03-0.077, » 5700 deg 2 ), and therefore our con- 
clusion should be robust. 

4.2. Construction of Mock Clusters 

We first investigate whether the way galaxy pool is con- 
structed affects our results. Instead of pouring galaxies from 
all clusters into one pool, we can create two pools separately 
for galaxies in high and low luminosity clusters. High and low 
luminosity mock clusters are then created from the respective 
pools. We find that V = 0.98% and 92% for the high and low 
luminosity clusters, respectively. The trend is clear that BCGs 
in low luminosity clusters are far more likely to be the statis- 
tical extreme than their cousins in high luminosity systems. 

Secondly, we note that instead of utilizing the galaxy pool, 
we can construct mock clusters using fits to the observed LD 
of cluster galaxies. The LDs for the whole cluster sample, 
and for the high and low lu minosity subsamples, are measured 
by the method developed in lLin & M ohr (2004), to which we 
refer the reader for more details. The results are shown in 
Fig. [8] From top to bottom are the LDs for the high luminosity 
clusters, the whole sample, and the low lumi nosity clusters. 

When BCGs are included, the ISchechterl function is not a 
good description of the LDs. Instead, the bright end may 
be more appropriately described by a log-normal distribution. 
Therefore, in fitting the LDs we consider both a bright end- 
truncated Schechter function plus a Gaussian, 



(p(M)dM = 



iff 4>* M Q+1 exp (-M )dM if M > M ch 



, exp 



(M-M, )• 
2<r?, 



dM 



otherwise 



(1) 



where M. = iq-° a ( m - m >)^ and a single Schechter function. The 
best-fit parameters are given in Table [4] 

Using the Gaussian+Schechter function fit to the whole 
sample as the LD, we find that for the whole, high, and low 
luminosity samples, V = 0.47%, 0.013%, and 39%, respec- 
tively. If using the Schechter function fit to the LD of the 
whole sample, V = 0.57%, 0.015%, and 44% for the three 
(sub)samples, respectively. These results are recorded in the 
third and fourth columns of Table [2] 

We thus conclude that our results are robust irrespective of 
the way the mock clusters are constructed. Another implica- 
tion from this exercise is that our method does not rely on 
largely complete spectroscopic dataset like SDSS; as long as 
(1) one can determine the LD accur ately (e.g., via a statisti- 
cal background subtraction method; iLin et al.ll2004l) . and (2) 
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TABLE 4 

Luminosity Distribution Parameters in t-band 



single Schechter Gaussian+Schechter" 

sample a Ml a ~M~* M c h ~4>l ~Ml cr L M 

all -1.01 8.53 -21.92 -0.93 10.25 -21.70 -22.40 1.38 -22.25 0.32 

high -1.17 12.62 -22.13 -0.99 18.30 -21.73 -22.60 1.90 -22.57 0.25 

low -0.93 6.34 -21.78 -0.73 8.92 -21.28 -22.05 1.35 -21.98 0.33 

a see Eq.0for the definition of the parameters. 




-24 -22 -20 
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FIG. 8. — Composite LDs in the r-band for the 494 C4 clusters at z = 
0.030 — 0.077. The red, green, and blue symbols represent the LD for high 
luminosity clusters, the whole cluster sample, and low luminosity clusters, 
respectively. The curves are Gaussian+Schechter function fits to the LD (see 
Eq.[T}; the best-fit parameters are given in Tablef4] 

the cluster membership of BCGs can be reliably assigned, our 
test can be applied. 

4.3. Sensitivity on Photometry of BCGs 

We remark in §Q] concerning the difficulty in the measure- 
ment of the total light of the BCGs. Here we examine the ef- 
fect of underestimation of BCG luminosity (and more gener- 
ally, luminosity of galaxies with large angular extents) on both 
our test and the TR tests. We address this issue by using two 
methods to obtain the best estimates of the true galaxy magni- 
tudes for our galaxy sample. The first one is a re-measurement 
of the galaxy photometry from the reduced SDSS images for 
every galaxy in our sample. The second one employs a statis- 
tical correction to the official SDSS photometry. 

Simply put, the reason that the SDSS photometric pipeline 
has difficulties dealing with the photometry of large galaxies 
is because it is defaulted to use the "local" sky flux measured 
within regions of 256 x 256 pixels (1.7' x 1.7') in the CCD 
frames. In the case that a large fraction of a given region is 
occupied by galaxies, the sky background level will be biased 
high, and in turn the galaxy magnitude will be lower than the 
true value. A remedy is to use the "global" sky level mea- 
sured in frames of 2048 x 1498 pixels (13.5' x 9.8') Follow- 
ing this line of reasoning. lvon d er Linde n et al.l (120071) devised 
a way to recover the true magnitude, given the crowdedness 
of a field, and the relative values of the local and global sky 
background. 

As our first approach, we adopt the methodology of 
Ivon der Linden et all d2007l) to apply corrections to the mag- 



nitudes of the galaxies and change the cluster luminosity ac- 
cordingly, and then use the modified galaxy catalogs (and the 
galaxy pool) to conduct our test and the TR tests. It is found 
that V = 1.2%, 0.012%, and 49%, for the whole, high lumi- 
nosity, and low luminosity cluster (sub)samples, respectively. 
As for the TR tests, for the whole sample, (Ti,p\,T 2 ,p 2 ) = 
(0.99, 11.1%, 0.77, 2 x 10" 4 ); for the high and low luminosity 
subsamples, (T u p u T 2 ,p2) = (0.68,5 x 10" 5 , 0.77, 2.4%) and 
(0.84,7.1%, 0.77, 3 x 10" 4 ), respectively. 

Using an independen t photometric reduction software, 
Hvde & B ernardil d2009) quantified the degree of underesti- 
mation of the magnitudes of large galaxies by the official 
SDSS pipeline. Their tests suggest that, for early type galax- 
ies with effective radius » 10", the mean magnitude deficit 
is about 0.25 mag, with a 68% scatter about 0.04 mag. They 
provided a fitting function that gives the mean deficit Am as 
a function of the angular size 8, which we use as the basis of 
our second method for recovering the true galaxy magnitudes. 
For every galaxy in our sample, we assume its magnitude was 
underestimated by the SDSS pipeline by Am(8) + Sm, where 
5m is a Gaussian random variate with dispersion of 0.04 mag, 
and is determined from the de Vaucouleur fits to the galaxy 
surface brightness profile (see lHvde & Berna rdi 2009 for de- 
tails). 

For the galaxy and cluster catalogs thus modified, we find 
that (7i , T 2 ) = (0.99, 0.77) for the whole sample, and the prob- 
ability to obtain T\ and T 2 from the galaxy pool as low as the 
"observed" values is 11.3% and 8 x 10~ 5 , respectively. On 
the other hand, the probability that d b s and d s i m are drawn 
from the same distribution is V = 1.2%. Breaking the sam- 
ple into high and low luminosity subsamples, the results are 
(T u p u T 2 ,p 2 ) = (0.68,5 x 10" 5 , 0.78, 2.2%) for the luminous 
clusters, and (0.84,7.5%, 0.77, 2 x 10" 4 ) for the faint clusters. 
From our test we find that BCGs in high and low luminosity 
clusters have probabilities of V = 0.012% and V = 49% to be 
statistical. 

These results suggest that both our test and the TR tests are 
insensitive to the uncertainties in galaxy photometry due to 
sky subtraction. 

It is also important to check if the low luminosity clusters 
are systematically at lower redshifts than the luminous ones. 
If this were true, the issue with sky subtraction of the SDSS 
pipeline may affect the photometry of BCGs in low luminos- 
ity clusters (hereafter LLCBCG) more strongly than that of 
BCGs in high luminosity clusters (hereafter HLCBCG), caus- 
ing the photometry of LLCBCG to be underestimated more 
than the case for HLCBCG. This would in turn affect our find- 
ing that LLCBCG are consistent wi th be ing drawn from the 
same LD as other cluster galaxies ( jj3.21 i. We have shown in 
$2]that the high and low luminosity clusters have very similar 
redshift distribution, and therefore we do not think the pho- 
tometry of LLCBCG and HLCBCG is treated differently. 
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5. DISCUSSIONS AND SUMMARY 
5.1. Implications on BCG Formation 

The main result of the present study is that BCGs as a whole 
have a LD that is distinct from that of the majority of red 
cluster galaxies (those with M r < -20). This conclusion is 
primarily due to the high luminosities of HLCBCG, which 
has only 0.03% of probability to be drawn from the LD of 
all galaxies. On the contrary, LLCBCG are more likely to be 
simply the statistical extreme of the LD of all galaxies. 

With the help of mock clusters, and the elements that come 
into the calculation of TR tests, one gains some insight into 
the BCG formation process. For high luminosity clusters, we 
find that (a u a A ,A) = (0.39,0.43,0.55) from the real data, 
while the corresponding values from the mock dataset are 
(0.42,0.37,0.44). Therefore, although u\ in real and mock 
clusters are comparable, in real clusters A is higher that that 
in mock ones (suggesting that real BCGs are on average 0.11 
mag brighter than the statistical extrme). Physics of BCG for- 
mation drives the magnitude gap to be larger than the statisti- 
cal expectation. 

To explain the origin of HLCBCG, one therefore needs 
to in v oke galactic merger s (e.g. , Ostriker & Tremaine 
1971 iHausman & Ostrikeri [1971 iLin& Mohr! 12004 
Coorav & Milosavrievicll2005t IVaTe & Ostriker 2008), which 
is naturally expected w ithin the hierarchical structure for- 
matio n paradigm (e.g., iDubinskil 1 19981 : iDe Lucia & Blaizotl 
120071) . However, the details of the mergers (e.g., major 
mergers between BCG and very luminous galaxies, minor 
mergers between BCG and M*-type galaxies, or the "galactic 
cannibalism'FT) rem ain to be understood. For example, while 
iLin & Mohrl d2004) suggested t hat major mergers are a viable 
route for forming HLCBCG, IVale & Ostrikeri d2008l) were 
more in favor of minor mergers. 

In principle, the extent of late mergers can be constrained 
by the L\, cg -L tot correlation, or statistics related to the mag- 
nitude gap between BCG and second-ran ked galaxy (e.g., 
iMilosavlievic et al.ll2006t lYang et al.ll2008h . We have devel- 
oped a simple merger model for cluster galaxies, and will 
present constraints on the importance of mergers based on the 
observations obtained in this paper in a future publication (Lin 
& Ostriker 2009, in preparation). 

After confirming that BCGs are indeed different from the 
bulk of cluster galaxy population, it is natural to ask if BCGs 
are different from other luminous/massive cluster galaxies 
(e.g., M < M» - 1). It has long been known that BCGs follow 
different scaling relations from other early type galaxies (e.g., 
Oegerle & Hoessellll991l: lLauer et ail 12007; Des roches et al.l 
20071; iBernardiet all 120071) . in the sense that the BCGs are 
larger, less dense, and have lower velocity dispersion, com- 
pared to other early type galaxies of the same luminosity. 
However, at the present it is not clear if the structure of the 
BCGs (in terms of Sersic fits) is indeed different from non- 
BC G early type gala xies of comparable color and stellar mass 
(see lGuo et all2009T) . 

We address this question somewhat indirectly; in 33.4| it is 
shown that the LD of G2s in high luminosity clusters are con- 
sistent with that of the whole cluster galaxy population. (Our 

1 1 We would like to point out that cannibalism proposed by Ostriker and 
co-workers is not binary mergers per se. It is based on the computable ten- 
dency of the most massive stellar systems to spiral to the centers of clusters 
via dynamical friction and to merge with other systems already resident at the 
center. 



method could not be robustly applied to examine the statisti- 
cal nature of G2s in low luminosity clusters, unfortunately.) 
Although we do not check the third-ranked brightest mem- 
bers, we believe a similar result will emerge for them. Effec- 
tively, we suggest that among the most luminous, red, cluster 
members, only BCGs in high luminosity clusters show sig- 
nificant deviations from the statistical extreme, and therefore 
these galaxies are a distinct population. 

Our results suggest both BCGs in low luminosity/mass 
clusters and G2s in luminous/massive clusters are the statis- 
tical extreme of the galaxy LD. As the latter are very likely 
BCGs (in clusters or groups that merge with the current host 
clusters) themselves at earlier stages of their lives, this finding 
seems to give a self-consistent picture of the luminous galaxy 
evolution within the hierarchical cluster evolution scenario. 

5.2. Cosmological Implications 

The results and reasoning presented in 33.4I & suggest 
that LRGs that are BCGs in high luminosity (massive) clusters 
are different from other LRGs (e.g., those that are LLCBCG 
and G2s or lower-ranked galaxies in high luminosity clusters), 
and call for very careful modeling and selection of the LRGs. 

Suppose the finding that HLCBCG and LLCBCG are two 
distinct populations continues to hold towards higher red- 
shifts. In a flux-limited survey, the LRGs at higher redshifts 
would be intrinsically more luminous than those at lower-z. 
Therefore, a larger fraction of LRGs at higher-z would be 
composed of HLCBCG, when compared to LRGs at lower- 
Z. When accounting for the Malmquist bias present in the 
LRG sample, one cannot simply assume that BCGs follow 
the same LD as LRGs as a whole, otherwise the inferred lu- 
minosity would be lower than the true value. This potential 
systematic effect would be larger towards higher-z. 

It is thus important to take into account the difference 
in the LD of HLCBCG and LLCBCG when creating mock 
LRG catalogs for BAO surveys. Unfortunately the BCGs in 
the present study are at lower redshifts (z < 0.1) even com- 
pared to SPSS and S DSS-II LRG studies (at z ~ 0.3; e.g., 
lEisenstein et al.l2005l) . and therefore the LDs we measure (see 
the Gaussian fits in Table |4]i are not readily applicable. Ac- 
cordingly our study strongly motivates for a systematic inves- 
tigation on the Lbcg-^tot correlation at z > 0.3, either through 
direct observations (similar to the methodology used in the 
present paper, utilizing spectroscopic data from e.g., BOSS 
or GAMA survey s), through the ha lo occupation distribu- 
tion analys is (e.g.JZheng et al.ll2008l) . or the non-parametric 
method of IVale & Ostrikeri (120061) . 

5.3. Summary 

The question of whether the luminosity distribution of 
BCGs is drawn from the same LD as other cluster galaxies has 
been extensively discussed over the last four decades. Here 
we propose a simple new test to examine the statistical nature 
of the BCGs, and to supp l ement the classical tests proposed 
bv lTremaine & Richstoneld 19771 TR). 

Our basic idea is to shuffle the cluster galaxies and see how 
likely the observed LD of BCGs can be reproduced. The pro- 
cedure is sketched in Fig. Q] Using a sample of 494 clusters 
from the C4 catalog at z = 0.03-0.077, we combine all mem- 
ber galaxies with M r < -20 to form a big pool of galaxies 
(5980 in total). We then randomly pick up galaxies from the 
pool to create 494 mock clusters, whose total luminosities L tot 
are matched to that of the real clusters as close as possible. 
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This way we can compare the correlation between the BCG 
luminosity Lb cg and L tot of real and mock clusters (see Fig.|4|i. 
Of course, there are more than one way to create an ensemble 
of mock clusters, so we repeat this Monte Carlo process to 
generate 200 mock ensembles. 

The averaged BCG luminosity from all 200 realizations 
of mock datasets, (ibcg.sim), gives the expected value when 
BCGs are statistical, as a function of cluster luminosity. The 
differences between the logarithms of mean mock BCG lu- 
minosity (Lbcg.sim) from each ensemble and (Lbcg.sim) repre- 
sent the degree of scatter expected if BCGs are stochasti- 
cally selected from a global LD. In Fig. [5] we compare the 
distribution of d sim = \ogL hcg , sim -log(L hcg , sim ) with that of 

dobs = logL b cg,ob s -log(Lbcg.sira), and find that the real BCGs 
are more luminous than the statistical expectation; there is 
0.8% of probability for d Q \, s to be drawn from the same distri- 
bution of afsim (see Table [2j. Separating the clusters into high 
and low luminosity subsamples (with a division luminosity 
ofL div = 3.7 x \0 u h^L Q ), we conclude that the difference is 
mainly coming from BCGs in high luminosity clusters (only 
0.03% chance to be statistical). The luminosities of BCGs in 
low luminosity clusters are roughly consistent with the global 
LD. 

We extend the analysis to discuss the statistical nature of 
the second brightest galaxies (G2s), and find strong evidence 
that (in high luminosity clusters), G2s have a LD similar to 
that of the whole cluster galaxy population. 

We also apply the tests proposed by TR to our cluster sam- 
ple, and record the results in Table [5] The two statistics (T\ 
and T2) both suggest a small probability for BCGs to be sta- 
tistical, consistent with our findings. However, examining the 
TR statistics for the high and low luminosity clusters reveals 
confusing trends. According to the T\ statistic, BCGs in the 
high luminosity clusters are much less likely to be statistical 
than their counterparts in the low luminosity clusters. How- 
ever, the T2 statistic suggests the opposite conclusion. 

Our results confirm previous findings that BCGs in high 
luminosity clusters are a distinct population from other cluster 
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